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ROBUST  DISTRIBUTED  BLOCK  AND  SEQUENTIAL  CONTINUOUS-TIME  DETECTION 


Evaggelos  Geraniotis 

Department  of  Electrical  Engineering 
and  Systems  Research  Center 
University  of  Maryland 
College  Park,  MD  20742 

ABSTRACT 

Two  detectors  making  independent  observations  which  are  the  outputs  of  stochastic  dynam¬ 
ical  systems  driven  by  colored  Gaussian  noise  must  decide  which  one  of  two  hypotheses  is  true. 
Detection  with  a  fixed  observation  interval  (block  detection)  and  sequential  detection  are  con¬ 
sidered.  The  decisions  are  coupled  through  a  common  cost  function  which  for  tests  with  a  fixed 
observation  interval  consists  of  the  sum  of  the  error  probabilities  while  for  sequential  tests  it 
comprises  the  sum  of  the  error  probabilities  and  the  expected  stopping  times.  For  the  case  of 
block  detection  the  time-varying  parameters  of  the  dynamical  system  belong  to  uncertainty 
classes  determined  by  2-alternating  capacities  or  to  classes  with  minimal  and  maximal  elements. 
For  the  case  of  sequential  detection  the  time-invariant  parameters  of  the  dynamical  system 
belong  to  classes  with  minimal  and  maximal  elements. 

A  minimax  robust  (worst-case)  design  is  pursued  according  to  which  the  two  detectors 
employ  tests  with  a  fixed  observation  interval  or  sequential  probability  ratio  tests  whose  likeli¬ 
hood  ratios  and  thresholds  depend  on  the  least-favorable  parameters  over  the  uncertainty  class. 
For  the  aforementioned  cost  function  the  optimal  thresholds  of  the  two  detectors  turn  out  to  be 
coupled.  It  is  shown  that,  despite  the  uncertainty,  the  two  detectors  are  thus  guaranteed  a 
minimum  level  of  acceptable  performance. 


This  research  was  supported  in  part  by  the  Systems  Research  Center  at  the  University  of  Maryland,  College  Park,  through  Na¬ 
tional  Science  Foundation  CDR-85-00108  and  in  part  by  the  Office  of  Naval  Research  under  contract  N00014-86-K-0013. 
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I.  INTRODUCTION 

In  [l]  and  [2]  distributed  discrete-time  fixed-sample-size  (block)  detection  and 
sequential  detection  problems,  respectively,  were  formulated  and  solved.  Continuous¬ 
time  distributed  detection  problems  with  known  statistics  are  considered  in  [3],  The  two 
detectors  collect  independent  observations  and  make  decisions  which  are  coupled 
through  a  common  cost  function.  Then,  the  optimal  decisions  are  characterized  by 
thresholds  which  are  coupled.  The  hypothesis  testing  models  considered  in  [1],  [2],  and 
[3]  assume  perfect  knowledge  of  the  statistics  of  the  observations. 

In  this  paper  we  investigate  distributed  detection  with  a  fixed  observation  interval 
and  sequential  detection  in  continuous  time.  The  observations  are  the  output  of  a  sto¬ 
chastic  dynamical  system  driven  by  colored  Gaussian  noise.  Due  to  modeling  uncertain¬ 
ties  or  partially  known  noise  characteristics,  the  parameters  of  the  stochastic  differential 
equation  are  only  known  to  belong  to  uncertainty  classes.  Similar  distributed  discrete¬ 
time  problems  with  statistical  uncertainty  are  treated  in  [4],  the  companion  to  this 
paper. 

Two  distinct  types  of  uncertainty  classes  are  considered:  (i)  classes  determined  by 
two  alternating  capacities  and  (ii)  classes  with  minimal  or  maximal  elements.  These 
uncertainty  models  have  been  very  popular  among  the  robust  statisticians  [5]- [8]  because 
they  include  several  useful  models  of  uncertainty  and  result  in  closed  form  expressions 
for  the  least-favorable  elements  in  the  class. 

The  design  philosophy  that  we  pursue  is  that  of  minimax  robustness.  According  to 
it,  the  worst-case  operational  conditions  are  identified  with  regard  to  the  error  probabili¬ 
ties  of  decision  designs  and  the  optimal  such  design  for  these  conditions  is  derived.  Sub¬ 
sequently  this  decision  design  is  employed  independently  of  actual  conditions  (which  are 
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not  known,  except  for  the  fact  that  they  belong  to  structured  uncertainty  classes  of  the 
types  (i)  and  (ii)  above)  and  it  is  shown  that  it  achieves  desirable  performance  despite 
the  uncertainty.  Minimax  robust  signal  processing  techniques  have  received  considerable 
attention  in  the  past  fifteen  years  (see  the  tutorial  in  [9]). 

In  this  paper,  we  first  consider  distributed  detection  schemes  with  fixed  observation 
interval  (block  detection)  and  observations  which  are  the  output  of  two  stochastic 
dynamical  systems  (driven  by  mutually  independent  Wiener  processes)  whose  time- 
varying  parameters  (means  and/or  variances)  belong  to  either  2-alternating  capacity 
classes  or  to  classes  with  maximal  or  minimal  elements.  After  the  system  and  uncertainty 
models  are  introduced,  the  least  favorable  operating  conditions,  as  well  as  the  minimax 
robust  likelihood  ratio  tests  which  are  coupled  through  their  thresholds,  are  derived.  We 
also  examine  the  behavior  of  the  Chernoff  upper  bounds  on  the  joint  cost  function  of  the 
test  in  the  presence  of  uncertainty  and  show  that  the  exponential  convergence  to  zero  (as 
the  duration  of  the  observation  interval  increases)  is  guaranteed  for  statistical  uncer¬ 
tainty  within  the  aforementioned  classes.  These  problems  are  treated  in  Section  II  of  the 
paper. 

In  Section  III  sequential  tests  are  employed.  In  this  case  we  derive  the  least  favor¬ 
able  operational  conditions  and  the  minimax  robust  sequential  probability  ratio  test  for 
two  stochastic  dynamical  systems,  whose  parameters  belong  to  classes  with  minimal  or 
maximal  elements.  The  joint  performance  criterion  now  includes-besides  the  error  pro¬ 
babilities  of  the  tests— the  expected  values  of  the  stopping  time  of  each  decision  maker 
under  the  two  hypotheses.  The  sum  of  the  asymptotic  speeds  of  the  two  sequential  tests 
which  are  inversly  proportional  to  the  informational  divergence  (Kullback-Leibler  dis¬ 
tance)  is  also  robustified. 
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n.  ROBUST  DISTRIBUTED  CONTINUOUS-TIME  BLOCK  DETECTION 


ILA  Problem  Formulation  and  Models  of  Uncertainty 

Consider  the  following  hypothesis  testing  problem  of  two  simple  hypotheses  H0  and 
Hl  with  two  decision-makers.  Decision-maker  i  (i  =  1,  2)  is  equipped  with  a  sensor 
and  is  faced  with  testing  the  hypotheses  Hi  versus  H0: 


Hi  :  dYtji  =  Mu{  dt  )  +  dWt  i 

H0  :  dYtii  =  Moi  (  dt  )  +  dWt>i  ,  0  <t<T  (1) 


In  (l)  Mjj  for  j  =  0,  1  and  i  =  1,  2  are  measures— defined  on  the  sample  space 
D  =  [0,  J1]  and  the  associated  <r-field  R—which  belong  to  classes  of  the  form 


Mjti  =  |  Mjti  G  M  I  Mj  i  (A  )<  Vj  {  (A  ),VA  eB,  Mjti  (Q)  =  vJfi  (0)|,  (2) 

where  M  is  the  class  of  measures  on  The  non-standard  Wiener  process  Wt  ,■  are 

mutually  independent  for  i  —  1,  2,  have  zero  means  and  variances 


£{  [i  dW<.<  ]  /  = 


=  E,  (A),  V  A  E  B  . 


(3) 


The  measures  E,-  belong  to  uncertainty  classes  of  the  form 


( 


E,-  =  ^  E  ,•  <E  M  |  E,-  {A  )  <  Vf  {A  ),  VA  eB,  E,-  (O)  =  n,(D) 


(4) 


The  quantities  Vjj  in  (2)  and  ut-  in  (4)  are  2-alternating  capacities  and  will  be  defined 
below. 


The  decision  making  of  detectors  1  and  2  is  coupled  through  the  following  cost 


structure: 
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C ( d hd2;h  ) 


0  for  ^1  —  ^2  —  1 
e  for  d^d 2 
/  for  d1=  d2=£h 


(5) 


where  dltd2,h  €{0,l},  e  and  /  are  non-negative  constants,  and  we  assume  that 
/  >  2e  .  Since  the  cost  [<7(1,1;0)  =  C*  (0,0;  1)]  of  wrong  decisions  by  both  detectors  is 
expected  to  be  considerably  larger  than  the  cost 

[(7(0, 1;0)  =  <7(1, 0;0)  =  <7(0,l;l)  =  C^TO;!)]  of  a  wrong  decision  by  one  of  the  detec¬ 
tors,  this  assumption  does  not  impose  a  serious  restriction  on  the  generality  of  our  prob¬ 
lem  formulation. 


Next  we  define  the  2-alternating  capacities: 

Definition:  A  positive  set  function  v  on  a  sample  space  fl  and  assosciated  <r-field  B  is 
called  a  2-alternating  capacity  if  it  is  increasing,  continuous  from  below,  continuous 
from  above  on  closed  sets,  and  satisfies  the  conditions  v  (  <f>  )  =  0, 
v(A  \J  B  )  +  v  (  A  fl  B  )  <  v  (  A  )  +  v  (  B  ).  Suppose  now  that  M  is  the  class 
of  measures  on  (  Q  ,  B  )  and  m  £  M  is  any  such  measure.  Consider  the  uncertainty 
class  which  is  determined  by  the  2-alternating  capacity  v  as  follows  [compare  with  (2) 
and  (4)]: 

Mv  =  |  m  £  A/  |  nz(A)<v(A)  ,  V  A  €  B  ,m(n)  =  v(fl)^.  (6) 

When  fi  is  compact  several  popular  uncertainty  models  like  e-contaminated  neighbor¬ 
hoods  [5],  total  variation  neighborhoods  [5],  band  classes  [6]  and  p-point  classes  [7]  are 
special  cases  of  this  model. 

Example:  The  e-contaminated  model  [5] 
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Mt  —  |  m  E  M  |  m  (A  )  =  (1  -  e)  m°(A  )  +  efn(A),  V A  E  B ,  m°(0)  =  m  (7) 

for  e  E  [  0,  1  ].  Then  v  (A  )  —  (  1  -  e  )  m°(  A  )  +  e  m°  (  f2  ) 

Fundamental  properties  of  these  uncertainty  models  have  been  studied  by  Huber 
and  Strassen  [8],  We  will  state  the  relevant  properties  as  a  Lemma. 


Lemma  1:  Suppose  v0  and  are  2- alternating  capacities  on  (  Q  ,  B  )  and  M0  and 
are  the  uncertainty  classes  determined  by  them  as  in  (1).  Then  there  exists  a  Lebesgue- 
measurable  function  7r„  :  f2  — +  [  0,  oo  ]  such  that 


6  v0  ({irv 

>*})  + 

#i  ({»# 

VI 

<  6  v0(A  )  +  u1(Ac  ) 

(8) 

for 

all  A  E  B 

and  all 

e  >  o. 

Furthermore  there  exist  measures  (  m0  , 

,  rhi  )  in 

M0 

X  Mi  such  that 

m0  (  { 

>«}) 

=  u0(  { 

> 

0}) 

(9) 

mx  (  {  tt„ 

<*>) 

=  M  { 

VI 

A 

*>) 

(10) 

(that  is,  7r„  is  stochastically  largest  over  M0  under  m0  and  stochastically  smallest  over 
Mi  under  rhi)  and  nv  is  a  version  of  drhi/drh0  and  is  unique  a.e.  [  m0  ].  The  measures 
(  m0  ,  m  j  )  are  termed  the  least-favorable  measures  over  M0XMx. 


Example:  The  e-con taminated  mixture  uncertainty  classes  described  by 

Mj  =  |  my  E  M  \  my  =  (1  -  ey)  my°  +  ey  m;-  ,  ihy(fi)  =  m°(0)  j  ,  j  =  0,  1  (11) 

associated  with  the  2-alternating  capacities 

J(l-ey  )my°U  )  +  ey  ,  A  <j> 

vi  (  A  )  =  \  0  ,  A  =  <t>  ’ 


(12) 
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have  the  least-favorable  distributions 


dm  0/d  \  — 


(  1  -  e0  )  dm®  /d  X  , 

- —  dmj°/dX  , 

c  2 


dm  i  /dm  0°  <  c  2 
c2  <  dm®  /dm0° 


dm  j/d  X  — 


(  1  -  €j  )  dm  0  /d  X  , 
c  i  (  1  —  ej  )  cfm  o*  / X 


c  i  <  dm  i  /dm0° 
dm®  /dm  §  <  Cj 


(13) 


(14) 


and  the  Huber-Strassen  derivative  irv 


nv  =  dm1/dm0 


1  -  Cl 
1  ~  e0 


min  |  c2  ,  max  (  Cj  ,  dm  ®  /dm0°  ) 


) 


(15) 


where  O<CJ<C2<00  are  such  that  m1(Q)  =  m0(ff)  =  l. 

For  the  uncertainty  classes  in  (2)  and  (4)  we  assume  that  the  nominal  measures 
satisfy  My0,-  «  X  and  S°  «  X  (i.e.,  they  are  absolutely  continuous  with  respect  to  X, 
the  Lebesgue  measure  on  U  =  [0,  T"  ])  so  that  the  least-favorable  measures  My  ,-  «  X 
and  E,-  «  X  and  we  define  Ay,«(0  =  (dMy,- /d\)(t )  and  [£,•(<  )]2  =  (d  E,- /d  X)(f ). 


We  also  consider  the  hypothesis  testing  problem 


H  j  :  dE,,.-  =  Mi,,(0  dt  +  cr,-(t )  dWt  ,- 

/f0  ;  dE,  ,-  =  noi(t)  dt  +  a{{t)  dWt  {  ,0<t  <  T  (16) 

where  Wt  ,-  are  mutually  idependent  standard  Wiener  processes  (*  =1,  2), 

Hj'i(t)  ( j  —  0,  1  ,  i  =  0,  1  )  belong  to  the  following  classes  with  minimal  or  maxi¬ 
mal  elements. 


Mii  =  {  Mi  ,i(t )  |  Mi,<  (« )  >  h,i  (0,  0  <  * 


<rj 


(17) 
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M0,i  =  |/io,,  (0  I  Mo.i(t)  <  Ao,,  (0.  0  <  t  <r|  ,  (18) 

and  (Ti  ( t )  (i  =1,2)  belong  to  the  classes 

£,•  =  |  MO  I  *»(0  <  MO . 0  <  t  <T  |  (19) 

For  either  the  hypothesis  testing  problem  (1)  or  that  of  (16)  we  assume  that  the  a 
priori  probabilities  for  the  hypotheses  H0  and  H1  are  X  and  1-X,  respectively,  and  that 
likelihood  ratio  tests  are  employed,  the  average  cost  is 

J (L  hL  2~ViJ)2)  =  Me  [*»o,i({Ix>fli})  +  ™0'2({L2>~t)2})} 

+  (/  -2e)m0il{{L1>v1})-m0i2{{L2>~T)i})} 

+  (1-X){e  [m  hl({L  !<'»?!})  +  mli2{{L2<~r]2})} 

+  (/  -2e)mu({L|n)  <Vi})  m  12{{L 2<~ri2})}  (20) 

In  (20)  rrij  are  the  probability  measures  induced  by  the  Wiener  processes  of  (1)  [or  (16)] 
of  the  »-th  decision-maker  (i  =  1,2)  under  hypothesis  Hj  ( j  =  0,1).  By 
Li  —  dm^  i/drriQi  we  denote  the  likelihood  ratio  based  on  observations  of  the  *  -th 
decision-maker  over  the  interval  [0,T]  and  by  ~r}f  its  threshold. 

The  optimal  thresholds  for  (20)  are  the  pair  (»?i,»?2)  which  minimizes  the  average 
cost  function  J(LhL 2~Vi~V2)>  that  is 


=  ar9  min  J(L  hL  2~t)1~t]2)  (21) 

Actually  the  likelihood  ratio  tests  (LRTs)  are  the  optimal  policies  for  the  two- 


decision-maker  problem  formulated  above  as  stated  in  the  following  proposition 


Proposition  1:  Likelihood  ratio  tests  (LRTs)  with  thresholds  which  minimize 
J(L  i,L  2SI1JI2)  of  (20)  are  optimal  over  all  tests  for  the  aforementioned  common  cost 
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structure 

Proof:  The  proof  follows  closely  the  corresponding  proof  of  [l]  about  the  optimality  of 
the  one-detector  strategy  (i.e.,  the  likelihood  ratio  test)  in  this  case  of  decision  makers 
with  independent  observations,  and  will  be  ommitted. 

n.B  Robust  Distributed  Block  Detection 

The  expression  for  the  average  cost  function  in  (20)  is  valid  for  the  case  that  there 
is  no  uncertainty  in  the  statistics  of  the  observations  of  the  two  decision  makers.  In  the 
presence  of  uncertainty  within  the  2- alternating  classes  My  ,-  and  £,-  of  (2)  and  (4)  [or 
the  classes  with  maximal  or  minimal  elements  of  (17)-(19)],  the  likelihood  ratios  L,-  and 
the  thresholds  ,  i  =  1,  2,  which  are  matched  to  the  least-favorable  mean  and  variance 
measures  My,-  and  £,-  (singled  out  by  Lemma  1)  of  the  classes  My  ,-  of  (2)  and  £,-  of  (4), 
respectively,  are  employed.  Similarly  in  the  case  of  the  uncertainty  classes  of  (17)-(19). 
In  these  cases  the  average  cost  function  under  mismatch— that  is,  when  the  statis¬ 
tics  of  the  observations  are  actually  governed  by  the  probability  measures  roy  ,-  induced 
by  the  Wiener  processes  with  the  means  and  variances  above— is  given  by  J (L  VL  2>Vi>V2) 
which  is  obtained  from  (20),  if  we  replace  X,-  by  Li  and  ?/,-  by  t),-  ,  for  i  —  1,  2,  and 
these  thresholds  are  the  solution  to  the  minimization  problem: 

(vvh)  =  ar9  min  /(L  ^L^M  >  (22) 

VitVi 

where  J  (L  hL  2~Vv1l2)  *s  the  average  cost  when  the  likelihood  ratios  L,-  (i  —  1,  2  for  the 
two  detectors)  are  employed  and  the  observations  are  distributed  according  to  the  proba¬ 
bility  measures  rhy  ,-  induced  by  the  Wiener  process  with  the  aforementioned  least- 
favorable  means  and  variances. 
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From  the  results  presented  in  [10]  we  cite  the  following  Lemma  for  a  single  detector 
(say  the  t-th  detector  i  =  1,  or  2) 


Lemma  2:  For  either  the  hypothesis  testing  problem  (1)  [for  the  t-th  detector]  and  the 
uncertainty  models  (2)  and  (4),  or  the  problem  of  (16)  and  the  uncertainty  models  of 
(17)-(19),  the  likelihood  ratio  test  based  on 


InLi 


c  Ai,.-(0  Ao,»(0  l  r 

Jn  ]d,(0]2 


[AM(0f  -  [Ao,.{QP 

MO]2 


dt , 


(23) 


is  a  minimax  robust  test;  i.e. ,  the  error  probabilities  for  this  test  under  the  two 
hypotheses  satisfy  the  inequalities 


>  Vi})  <  rh0}i{{ti  >  ii,})  <  m0ii  ({(?,•  >  Vi }) 

«i, »({A-  <  m})  <  *«i ,<({A-  <»?»})<  <  »?»•}) 

for  all  thresholds  ?/,•  satisfying 

|  In  ru  I  <  If  [Am(0-A..,(0]2  dl 
1  1  -  2Jn  [a-,(0]2 


(24) 

(25) 


In  (24)-(25)  rhj  i  are  the  measures  induced  by  the  Wiener  processes  of  (1)  or  (16)  when 
the  parameters  involved  in  these  equations  are  the  least-favorable  elements  of  the 
corresponding  uncertainty  classes. 

Proof:  See  Propositions  3  and  6  of  [10]. 

The  main  result  of  this  section  now  follows: 


Proposition  2:  The  LRTs  of  the  two  detectors  based  on  the  log-likelihood-ratio  of  (23) 
for  i  =  1,  2  are  minimax  robust  with  respect  to  the  average  cost  function  defined  in 
(20),  for  the  problems  (1)  and  (16)  with  parameters  in  the  uncertainty  classes  (2)  and  (4) 
or  (17)-(19),  respectively,  that  is 
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J {L  i,L iiVvVi)  5;  i>L  21^1^2)  <  J  (G \>G  <brh>rl,2)  (26) 

where  (5,-  (2  =  1,  2)  are  any  decision  statistics  operating  on  the  outputs  of  the  dynami¬ 
cal  systems  of  (1)  or  (16). 

Proof:  The  right-hand-side  inequality  in  (26)  is  a  straightforward  application  of  Propo¬ 
sition  1  to  the  case  characterized  by  rft  ,•  ( j  =  0,1  and  2  —  1,  2)  for  which  the  likeli¬ 
hood  ratios  are  L{  and  the  optimal  thresholds  are  27,- . 

The  left-hand-side  inequality  in  (26)  is  a  consequence  of  Lemma  2.  Specifically  we 
apply  the  left-hand-side  inequalities  in  (24)  and  (25)  to  the  probability  measures 
{moil  m0i )  and  (my,  my),  respectively,  of  the  two  detectors  (1  =  1,  2),  and  then 
use  the  definitons  of  the  mismatch  average  cost  function  J  and  the  average  cost  function 
J  matched  to  the  pair  of  probability  measures  (moi,m  1,)  induced  by  the  least- 
favorable  parameters  of  the  classes  (2)  and  (4)  or  (17)-(19). 

Note:  The  optimal  thresholds  (2)1,272)  can  be  determined  from  the  error  probabilities 
d,-  ft  (f—1,2)  for  the  least-favorable  case  of  problem  (1)  by  minimizing 

min  j  X  [e  (oq  +  d2)  +  (/  -2e  )a1d2  j  +  (1-X)  [e  (ft  +  ft)  +  (/  ~2e  )ftft 
under  the  constraints  ft  =  /,•  (d,- )  [operating  receiver  characteristic  (ROC)  for  detector 
2  ],  0<d,-  <1,  0 <ft  <1,  and  d,-  +  ft  <1  for  *  =1,2. 

n.C  Asymptotic  Performance 

It  is  of  interest  to  examine  the  behaviour  of  the  joint  cost  function  under  the  two 
hypotheses  as  T ,  the  length  of  the  observation  interval,  increases.  In  particular,  it  is 
desirable  that  the  robust  likelihood  ratio  tests  described  in  Propositions  2  behave  asymp¬ 
totically  (for  large  T )  in  an  optimal  way, 
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For  continuous-time  problems  with  uncertain  statistics  the  asymptotic  performance 
of  the  ChernofT  bounds  on  the  error  probabilities  of  a  single  detector  (say  the  i -th  detec¬ 
tor)  with  observations  obeying  (1)  or  (16),  and  uncertainty  models  (2)  and  (4)  or  (17)- 
(19)-when  the  likelihood  ratio  test  of  (23)  is  employed-has  been  examined  in  [10].  The 
next  two  Lemmas  contain  the  results  of  [10]  in  a  condensed  form. 


Lemma  3:  The  error  probabilities  for  the  hypothesis  testing  problems  of  (l)  and  (16) 
[for  the  *  -th  detector]  can  be  upperbounded  as 

™0,,-({4-  >  71 })  <  exp{-T[s7i  +  C0ii(s  ,.£,•)]}  (27) 

and 

m  i,i  {{Li  <  li  T  })  <  exp{-T  [-s  7 i  +  Cu  (s  Xi )]}  (28) 

for  all  s  in  (0,1). 

In  (27)-(28)  the  Chernolf  distances  Cji(sXi)  f°r 
(j  =  0,  1)  and  i  —  1,  2)under  mismatch  are  defined  as 

,4)  =  “!■>[£»,(  W}]  (29) 

and 


(30) 


and  the  threshold  is  77,-  =7 ,•  7\  For  uncertainties  in  (2)  and  (4)  the  ChernofT  distances 
actually  take  the  form 


Co,i  (s  Xi )  — 


2  r  f  /*!,.-(*  H*o,i(0 


[MOP 


t 

y 


y,  /  1,  \  r  Ai,i(0  Ao,»(0  ,,  /  j,  \  |  s 

S,-  (dt)-S  JQ  M0ti  [dt  )  +  2  c; 


(31) 
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C 1  ,i  (5  A  ) 


-/  ( w  +  .  fMM lMl  ,<<«) 

H  MOP  /  4  MOP 


s 

- Ci 

2  ’ 


(32) 


where 


/. 


IMOlM/MOP 

IM(OP 


■d£. 


(33) 


For  uncertainties  in  (17)-(19)  we  need  to  replace  A/y  ,  (d£)  by  fij  i{t)dt  and  £,(d£)  by 
[cr,  (<  )}2dt ,  respectively,  for  j  =  0,  1  and  i  —  1,  2. 

Proof:  See  Proposition  4  of  [10]. 


Lemma  4:  The  Chernoff  upper  bounds  on  the  error  probabilities  of  the  hypothesis¬ 
testing  problems  (1)  and  (16)  with  the  uncertainty  models  of  (2)  and  (4)  and  (17)-(19) 
aproach  zero  exponentially  with  increasing  T --despite  the  uncertainty—when  the  likeli¬ 
hood  ratio  test  of  (23)  is  employed. 

Proof:  See  Proposition  5  of  [10], 

The  following  proposition  provides  the  desired  asymptotic  result  for  the  mismatch 
average  cost  function  /(L  i,L2>*h>*?2)  as  the  length  of  the  observations  interval  T 
increases: 

Proposition  3:  Under  the  assumptions  of  Proposition  2,  the  average  cost  function 
under  mismatch  converges  to  zero  exponentially  as  the  length  of  observations  interval  T 
increases,  despite  the  uncertainty;  that  is,  J(L  ltL  2,?h>J?2)— as  T  — ►  oo  for  all  values  of 
the  parameters  in  the  uncertainty  class  (2)  and  (4)  or  (17)-(19). 

Proof:  By  applying  Lemma  3  to  the  error  probabilities  of  the  hypothesis  testing  prob¬ 
lem  of  each  of  the  two  detectors  and  using  the  definition  of  J {L  \,L  2A1A2]  we  derive  an 
upper  bound  on  the  average  cost  under  mismatch  in  terms  of  the  Chernoff  bounds.  This 
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takes  the  form 

J {L  i,L  2,Vi,V2)  <  X{e  [exp{-T  [s  7i+Co,i(s  X 1)]}  +  exp{-T  [s  72+<?o,2(s  ,L  2)]}] 

+  (/  -2e  )exp{-T  [s  7i+<?o,i(s  £  i)]}exP{-r  Is  72+<?o,2 («  X  2)]}} 

+  (1-X){e  [exp{-T[-s  ^X+C  u(s,Li)]}  +  exp{-T  [-s  l2+^i,2(s  &  2)]}] 

+  (/  -2e  )exp{-T  [-S71+C'1>1(s  ,L  i)]}exp{-T  [s l2+Ci,2{s  A)]}} 

(34) 

where  fa  —  7,-  T  is  the  threshold  for  the  *-th  detector  (i  =  1,  2).  Finally  we  apply 
Lemma  4  to  (34)  to  complete  the  proof  of  Proposition  3. 
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m.  ROBUST  DISTRIBUTED  CONTINUOUS-TIME  SEQUENTIAL 
DETECTION 

m.A  Problem  Formulation  and  Model  of  Uncertainty 

The  distributed  sequential  detection  problem  that  we  consider  in  this  section  has  a 
lot  of  sismilarities  with  the  problem  considered  in  the  previous  section.  The  two  decision 
makers  are  faced  with  the  hypothesis  testing  problem 

Hq  :  dYt  i  =  nos  dt  +  cr,  dWt  i  , 

:  dYtii  =  Hn  dt  +  a ,■  d\Vt  i  ,  0  <  t  <  rf  (35) 

where  r,-  is  the  stopping  time  [11]  of  the  i-th  detector  and  ,  fio  i  ,  0{  are  time- 
invariant  parameters  which  belong  to  the  following  uncertainty  classes  characterized  by 
their  minimal  or  maximal  elements  (for  which  /zlt-  >  /z0 ,). 


Mi  ,i  = 

|  Mi,;  1  Mi ,i  >  Mi,i  |  , 

(36) 

M0,i  = 

^  Mo,;  I  Mo i,»  ^  Mo ^  , 

(37) 

=1 

(38) 

The  cost  function  C  (y;  )  of  (5)  remains  the  same  as  in  section  II.  However,  now  there  is 
also  a  cost  for  collecting  data,  which  for  the  i-th  decision  maker  (i  =  1,  2)  is  defined 
by: 

+  (39) 

where  &,•  ( i  =  1,  2)  are  nonnegative  constants,  Ej  j  denotes  expectation  with  respect  to 
the  probability  measure  mj  ,  (under  the  hypothesis  Hj ,  j  =0,1,  and  for  the  i-th 
detector,  i  =  1,  2)  induced  by  the  Wiener  process  Wt  i  of  (35),  the  a  priori  probabilities 
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for  the  hypotheses  H0  and  Hx  are  X  and  1— X,  respectively,  and  the  random  variable  r,-  is 
the  stopping  time  of  the  t'-th  detector;  i.e. ,  the  necessary  length  of  the  observation  inter¬ 
val  in  order  to  reach  a  decision  in  favor  of  one  of  the  two  hypotheses. 

Recall  [11]- [12]  that  for  a  single  sequential  detector,  the  optimal  test,  termed  the 
sequential  probability  ratio  test  (SPRT),  consists  of  keep  collecting  observations  till  the 
likelihood  ratio  L  T  based  on  the  observation  interval  [0,r]  exceeds  B  or  falls  below  A  — 
the  two  thresholds  (0<A  <1<R)~ in  which  case  a  decision  is  made  in  favor  of  Hl  or 
H0,  respectively. 

Assuming  that  SPRTs  are  employed  by  both  detectors,  we  can  write  the  average 
cost  as 

J ( L  i,L  2, A  i,B1,A  2,B 2)  =  x/ajEo^Tj  |  ^  j}  +  A2  £o,2{t2  I  L  2} 

+  e  [m(i,V  {{L  B  1})  +  m^,2>  ({-^2, 2})] 

+(/  -2e)m$({LhT>B1}ym$({L2j>~B2}) 

+(1-X)|  Aj^i  i{ri  I  Lx)  +  A2  ^i,2{r2  I  ^2 } 

+  e  [to  /  ^  {{L  i,Tl<A  1})  +  m  ({^2 ,t2<A  2})] 

+(/  -2e  )m  IV  ({L  hr<A  Jjm  {{L  2}T<A  2}) 

(40) 

where  r,-  (1  =  1,  2)  are  stopping  times  for  the  two  detectors,  that  is,  if  the  likelihood 
ratio  Li  r,  which  is  based  on  observations  in  the  interval  [0,r],  is  larger  than  or  equal  to 
Bi ,  it  is  decided  that  Hi  is  true,  the  test  terminates  and  r,-  —  r,  if  it  is  smaller  than  or 
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equal  to  Ai ,  it  is  decided  that  H0  is  true,  the  test  again  terminates  and  r,-  =  r;  other¬ 
wise,  more  observations  are  collected  ([0,r]  increases)  and  the  procedure  continues,  rrij-p 
is  the  probability  measure  which  governs  the  observations  of  the  i-th  detector  under 
hypothesis  Hj  ( j  —  0,  1)  when  the  SPRT  terminates  after  the  interval  [0,r,  ]  has  been 
processed.  The  notation  Ej  ,•  {r,-  |  Li}  has  been  preferred  over  the  notation  Ej  ,•  {t,-  }  for 
the  expected  value  of  r,-  under  the  probability  measure  m;-  ,  and  an  SPRT  employing 
the  likelihood  ratio  L,-  =  dm1(/dm0,-,  because  it  allows  us  to  consider  situations  of 
mismatch,  that  is,  when  the  likelihood  ratio  employed  is  not  the  one  corresponding  to 
the  operating  probability  measures. 

The  optimal  thresholds  for  (40)  are  the  quadruple  (A  ltB  VA  2,B  2)  which  minimizes 
the  average  cost  function  J(L1,L2,A  i,B  i,A  2,# 2)1  that  is 

(A  i,B  i,A  2,B  2)  =  arg  _  min  _  J (L  X,L  2,A  ±,B  ltA  2,B  2)  (41 

A  X,B  hA  2,B2  t  ' 

Actually  the  sequential  probability  ratio  tests  (SPRTs)  are  the  optimal  policies  for  the 
two-decision-maker  problem  formulated  above  as  stated  in  the  following  proposition 
Proposition  4:  SPRTs  with  thresholds  which  minimize  / (L  X,L  2,A  1}B  i,A  2,B  2)  of  (40) 
are  optimal  over  all  tests  for  the  aforementioned  common  cost  structure. 

Proof:  The  proof  is  provided  in  [2]  for  discrete-time  sequential  detection  and  in  [3]  for 
continuous-time  sequential  detection  and  it  establishes  the  optimality  of  the  one-detector 
strategy  (i.e.,  the  SPRT)  in  this  case  of  decision  makers  with  independent  observations. 
It  will  be  ommitted. 

UI.B  Robust  Distributed  Sequential  Detection 

The  expression  for  the  average  cost  function  in  (40)  is  valid  for  the  case  that  there 
is  no  uncertainty  in  the  statistics  of  the  observations  of  the  two  decision  makers.  In  the 
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presence  of  uncertainty  within  the  classes  of  (36)-(38),  the  likelihood  ratios  and  the 
thresholds  ( Ai,B, ),  i  =  1,  2,  which  are  matched  to  the  measures  my  induced  by  the 
least-favorable  parameters  in  the  above  classes  are  employed.  In  this  case  the  average 
cost  function  under  mismatch— that  is,  when  the  statistics  of  the  observations  are 
actually  governed  by  my  ,-  induced  by  any  parameters  in  (36)-(38)— is  given  by 
J(Ll}L  2,A  i,B i,A  2,B 2)  which  is  obtained  from  (40),  if  we  replace  L,-  by  L,-  and  (A,-,B,-) 
by  (Ai  ,B, ),  for  i  =  1,  2.  These  thresholds  are  the  solution  to  the  minimization  prob¬ 
lem: 


(A  X,B  X,A  2,B  2)  —  arg  _  min.  3  (L  x,t  2,A  hB  VA  2,B  2)  , 

1  It-A  2 


(42) 


where  J (L  ltL  2,A  ltB  ltA  %,B  2)  is  the  average  cost  when  SPRTs  based  on  the  likelihood 
ratios  L,  and  the  thresholds  (A,-  ,2?,- )  ( i  —  1,2  for  the  two  detectors)  are  employed  and 
the  observations  are  governed  by  the  probability  measure  my  ,-  which  is  induced  by  the 
least-favorable  parameters  (j  =  0,  1  for  the  two  hypotheses). 

From  [10]  we  cite  the  following  two  results  for  a  single  sequential  detector: 

Lemma  5:  Suppose  r,-  is  the  stopping  time  of  the  SPRT  associated  with  (35)  and  the 
SPRT  employs  a  likelihood  ratio 


A  A  A  2  *2 

,  f  v  /*1 

lnLi,r  =  - - Z7~2  T 

<7i  lOi 


(43) 


based  on  observations  over  [0,rj  and  thresholds  (A  ,B )  which  are  matched  to  the  case 
(  Moi  >  >°i  )  [the  least-favorable  parameters  in  the  uncertainty  classes  (36)-(38)] 

while  the  actual  operating  conditions  are  determined  [through  (35)]  by  (  fio  i  ,  f  ,  cr,-  ). 
Then  the  following  identities  are  true  for  the  error  probabilities  under  mismatch: 
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«<  =  m0 ,.'({4',r,  >  A  }) 


A,-  -  1 


(sr1^)'0"  - 1 


where 


0,i 


Vj  (  2  Mo,,-  -  Au  -  Ao,*'  ) 
Vi  (  Ai,i  -  Ao,i  ) 


and 


5,  -  1 


ft  =  ray  <  A, }),  =  .  ‘  - - 

(Vft )' -  -  1 


where 


(44) 


(45) 


(46) 


i,i 


^»2  (  2  Mu  -  Au  -  Ao,*-  ) 

Vi  (  Ai ,i  -  Ao ,»  ) 


(47) 


For  the  expected  stopping  times  under  mismatch  the  following  identities  hold: 


Eo,i  [  Ti  |  Li  )  — 


and 


E  l,*  [  Ti  \  Li  }  — 


-2  <jj  a;  (  d,- ,  /?,- ,  at-  ) 

(Ai,,  -Ao,i)  (  2  Mo,*  -  Mi,i  -  Ao,»  ) 


2  g  2  0}  (  Pi ,  «< ,  Pi  ) 

(Mu-Mo,.  )  (  2  Mu  “  Mi,,'  “  Mo,*'  ) 


where 


u  (  x ,  y ,  x  )  =  (  1  -  x  )ln  -  -  —  +  x  In  * 


1  -  V 


and 


&i  =  "1 0, ,'({£*', r,  >  5,  })  = 


1-A, 

^  a.  y 

Bi-Ai 


and 


(48) 


(49) 


(50) 


(51) 


Pi  =  n*u({A,r.  <  ^.  })  = 


i*(fi-l) 

B;-A; 


(52) 


A  A 

are  the  error  probabilities  in  the  matched  case.  The  thresholds  and  5,-  are  given  by 


Proof:  See  Lemmas  3  and  2  of  [10]. 


lemma  6:  Consider  the  hypothesis  testing  problem  of  (35)  [for  the  t-th  detector]  with 
parameters  in  the  sets  (36)-(38).  Suppose  that  the  following  assumptions  are  satisfied: 


(0  Mi.i  >  Mo,i 
,  „  1  -  & 

(«*)  In  -  »  -  Pi  In 

Oli 


Pi 


(i-&, )  ( i  -  A  ) ' 


Then  the  following  inequalities  are  true 

ai  ~~  m0,i{{L{ jt  ^  Bi })  <  ^  B{ })  =  &i  , 

Pi  =  mi,i({Li,r,  <  ^,})>  <  mu({tiiTt  <  i,  })  =  Pi  , 

and 


Ej,i  [  B  I  Ei  ]  <  Ei,i  M  Ei)  ,  j  =  0,  1 

Proof:  See  Proposition  8  of  [10]. 


(55) 

(56) 

(57) 

(58) 

(59) 


The  main  result  of  this  section  now  follows: 

Proposition  5:  The  SPRTs  which  employ  thresholds  (j4,-,J9,)  and  likelihood  ratios  Li  T 
defined  as  in  Lemma  5  [equation  (43)]  (for  the  two  detectors  i  =  1,  2)  are  minimax 
robust  with  respect  to  the  average  cost  function  defined  in  (40),  that  is 


J {L  i,L  2, A  i,B1,A  2,B2)  ^  J {L  i,L  2,A  i,A  2,B 2)  <  J  {G  i,G  2,A  i,Bi,A  2,^2) 


(60) 


where  G,-  (1  =  1,  2)  is  any  decision  statistic  operating  on  the  observations  over  the 
interval  [0,r,],  if  for  i  —1,2  a,-  and  Pi  of  (53)-(54)  satisfy  the  condition  (56). 
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Note:  The  optimal  thresholds  )  (i  =  1,  2)  can  be  determined  from  equations 

(53)-(54)  where  the  error  probabilities  a,-  and  ft  are  solutions  to  the  minimization  prob¬ 
lem: 


mm 


ufihftAi) 


+  k. 


u{dt2,p2,a2) 


E 01{-lnL  l  Tl}  E0  2{-lnL2  T2} 


+  e  (01+0:2)  +  (/  -2e  )djd2 


+  (!-x) 


i,ft) 


+  k 


u(fa,ot2,fa) 

2  "a ‘ 


+  e  (&+&)  +  (/  -2e  )ftft 


}  (61) 


under  the  constraints  0  <  d<  <  1,  0  <  ft  <  1,  and  d,-  4-  ft  <1  for  i  =  1,  2.  In  (61) 
the  quantities  in  the  denominators  satisfy  the  inequalities: 


2a  i  2a  i  ’  " } 


(62) 


E ,  ,  f/n£,  r  }  =  fio,i  )(2^l,i  /*l,t  £o,i  )  ^  (Al Mo,»  )2  a 

’  oa2  —  TT2 -  =  /il,» 


2d/ 


2d,- 


{/n4>,}  (63) 


which  are  valid  because  of  the  definition  of  the  log-likelihood  ratios  InL  T  and  of  the 

*  »Tj 

inequalities  that  define  the  uncertainty  classes  (36)-(38). 


m.C  Asymptotic  Performance 

The  following  proposition  provides  a  result  on  the  common  asymptotic  speed— 
which  is  defined  as  the  sum  of  the  asymptotic  (for  small  error  probabilities)  stopping 
times  of  the  two  detectors— of  the  robust  sequential  test. 

Proposition  6:  Suppose  that  for  the  problem  (35)  with  the  uncertainty  classes  (36)-(38) 
and  under  the  mismatch  conditions  of  Lemmas  5,  6  and  Proposition  5  above,  the  error 
probabilities  d,-  and  ft  (for  »=1,2)  approach  zero.  Then  the  sum  of  the  asymptotic 
expected  stoppong  times-under  mismatch  and  for  the  least-favorable  case-satisfy  the 
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inequalities 


-InA  i  -InA  o 

^-——^~  +  k2-  2 


E^i-lnL,}  E  oo{— /n£  2} 


+  (l~X) 


InB  0 

*  x—  T  +fc2~ 


<  X 


k  ~lnAl  !  k  ~lnA* 

+ (1-X) 

£o,lH^l}  e o,2{—inL  2} 

[klio^Tx  1  Gj}  +  2^0,2 {^2  1  ^2} 

+  (1-X) 

E  i,2{lnL  2} 

InB  1  InB  2 

— ; ,  7  ,  +  ^  2" 


£’^i{/nLi}  £’12{^n-^2} 


(64) 


where  and  G2  are  any  other  sequential  tests  different  from  the  SPRT;  Ej,i  denotes 
the  limit  of  the  expectation  Ej  {  as  on  — *•()  and  /?,•  — >0. 

Proof:  As  d,  — +0  and  /&,  — >0,  then  <*,•  and  /?,■  approach  zero  aa  well,  since  a(  <d<  and 
/?i  </?»  •  Thus  J  (L  ltL  2,A  i,B  x,A  2,^2)  (^he  joint  cost  function  under  mismatch)  reduces 
to  the  first  sum  in  (64),  whereas  J(L  1X2, A  i,BvA  2,jB2)  (the  joint  cost  function  matched 
to  the  least-favorable  case)  reduces  to  the  second  sum  in  (64).  The  first  sum  is  smaller 
than  the  second  sum  because  of  the  inequalities  (62)-(63).  The  second  inequality  in  (64) 
holds  because  of  a  theorem  by  Wald  [12]  (for  the  matched  single  detector  case)  which 
states  that  the  SPRT  has  the  minimum  asymptotic  speed  (expected  stopping  time) 
among  all  sequential  tests. 


22 


REFERENCES 

[1]  R.  R.  Tenney,  and  N.  R.  Sandell,  ’’Detection  with  distributed  sensors,”  IEEE  Tran¬ 
sactions  on  Aerospace  and  Electronic  Systems,  Vol.  AES-17,  pp.  501-510,  July  1981. 

[2]  D.  Teneketzis  and  Y.-C.  Ho,  “The  decentralized  Wald  problem,”  1985,  preprint. 

[3]  A.  LaVigna,  A.  M.  Makowski  and  J.  S.  Baras,  “A  continuous- time  distributed  ver¬ 
sion  of  Wald’ s  sequential  hypothesis  testing  problem,”  1986,  preprint. 

[4]  E.  Geraniotis,  “Robust  distributed  block  and  sequential  discrete-time  detection  in 
uncertain  environments”,  accepted  for  presentation  at  the  1987  Conference  on 
Information  Sciences  and  Systems,  Johns  Hopkins  University,  March  1987;  to 
appear  in  the  conference  proceedings. 

[5]  P.  J.  Huber,  “A  robust  version  of  the  probability  ratio  test,”  Ann.  Math.  Stat., 
Vol.  36,  pp.  1753-1758,  1965. 

[6]  S.  A.  Kassam,  “Robust  hypothesis  testing  for  bounded  classes  of  probability  densi¬ 
ties,”  IEEE  Transactions  on  Information  Theory,  vol.  IT-27,  pp.  242-247,  1981. 

[7]  K.  S.  Vastola  and  H.  V.  Poor,  “On  the  p-point  uncertainty  class,”  IEEE  Transac¬ 
tions  on  Information  Theory,  Vol.  IT-30,  pp. 374-376,  1984. 

[8]  P.  J.  Huber  and  V.  Strassen,  “Minimax  tests  and  the  Neyman-Pearson  lemma  for 
capacities,”  Ann.  Statist.,  Vol.  1,  pp.  251-265,  1973. 

[9]  S.  A.  Kassam  and  H.  V.  Poor,  “Robust  techniques  for  signal  processing,”  Proceed¬ 
ings  of  the  IEEE,  Vol.  73,  pp.  433-481,  March  1985. 

[10]  E.  Geraniotis  and  Y.  A.  Chau,  ”On  robust  continuous-time  discrimination,”  sub¬ 
mitted  for  publication  to  the  IEEE  Transactions  on  Information  Theory,  March 
1987. 

[11]  N.  Shiryayev,  Optimal  Stopping  Ruels,  New  York,  Springer-Verlag,  1977. 

[12]  A.  Wald,  Sequential  Analysis,  New  York,  Wiley,  1947. 


